MACHINE LEARNING FOR TEXT CLASSIFICATION IN BUILDING MANAGEMENT SYSTEMS

نویسندگان

چکیده

In building management systems (BMS), a medium may have between 200 and 1000 sensor points. Their labels need to be translated into naming standard so they can automatically recognised by the BMS platform. The current industrial practices often manually translate these points (this is known as tagging process), which takes around 8 hours for every 100 We introduce an AI-based multi-stage text classification that translates formatted labels. After comparing five different techniques (logistic regression, random forests, XGBoost, multinomial Naive Bayes linear support vector classification), we demonstrate XGBoost top performer with 90.29% of true positives, use prediction confidence filter out false positives. This approach applied in sensors networks various applications, where manual free-text data pre-processing remains cumbersome.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Emotion Detection in Persian Text; A Machine Learning Model

This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...

متن کامل

Text Classification Using Machine Learning Techniques

Automated text classification has been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing. In general, text classification plays an important role in information extraction and summarization, text retrieval, and questionanswering. This paper illustrates the text classification process using machine learn...

متن کامل

Applying Machine Learning to Amharic Text Classification

Even though the last years have seen an increasing trend in investigating applying language processing methods to other languages than English, most of the work is still done on very few and mainly European and East-Asian languages. However, there is a need for people all over the World to be able to use their own language when using computers or accessing information on the Internet. This requ...

متن کامل

Machine Teaching: A New Paradigm for Building Machine Learning Systems

The current processes for building machine learning systems require practitioners with deep knowledge of machine learning. This significantly limits the number of machine learning systems that can be created and has led to a mismatch between the demand for machine learning systems and the ability for organizations to build them. We believe that in order to meet this growing demand for machine l...

متن کامل

A Comparative Study of Machine Learning Approaches for Text Classification

Perhaps the single largest data source in the world is the world wide web. Heterogeneous and unstructured nature of the data on web has challenged mining the web. Practical needs to extract textual information and unseen patterns continue to drive the research interest in text mining. Faultless categorization of texts can be better performed by machine learning techniques. In this paper we pres...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Civil Engineering and Management

سال: 2022

ISSN: ['1392-3730', '1822-3605']

DOI: https://doi.org/10.3846/jcem.2022.16012